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ABSTRACT 

Cyclodipeptide synthases (CDPSs) belong to a newly 
defined family of enzymes that use aminoacyl- 
tRNAs (aa-tRNAs) as substrates to synthesize the 
two peptide bonds of various cyclodipeptides, 
which are the precursors of many natural products 
with noteworthy biological activities. Here, we 
describe the crystal structure of AlbC, a CDPS 
from Streptomyces noursei. The AlbC structure 
consists of a monomer containing a Rossmann- 
fold domain. Strikingly, it is highly similar to the 
catalytic domain of class-l aminoacyl-tRNA 
synthetases (aaRSs), especially class-lc TyrRSs 
and TrpRSs. AlbC contains a deep pocket, highly 
conserved among CDPSs. Site-directed mutagen- 
esis studies indicate that this pocket accommo- 
dates the aminoacyl moiety of the aa-tRNA 
substrate in a way similar to that used by TyrRSs 
to recognize their tyrosine substrates. These 
studies also suggest that the tRNA moiety of the 
aa-tRNA interacts with AlbC via at least one patch 
of basic residues, which is conserved among CDPSs 
but not present in class-lc aaRSs. AlbC catalyses its 
two-substrate reaction via a ping-pong mechanism 
with a covalent intermediate in which L-Phe is 



shown to be transferred from Phe-tRNA to an 
active serine. These findings provide insight into 
the molecular bases of the interactions between 
CDPSs and their aa-tRNAs substrates, and the cata- 
lytic mechanism used by CDPSs to achieve the non- 
ribosomal synthesis of cyclodipeptides. 



INTRODUCTION 

Cyclodipeptide synthases (CDPSs) constitute a family of 
peptide-bond forming enzymes that use aminoacyl-tRNAs 
(aa-tRNAs) as substrates to form various cyclodipeptides 
(1). They are associated with cyclodipeptide-tailoring 
enzymes in biosynthetic pathways dedicated to the synthe- 
sis of diketopiperazines (DKPs) (2,3), a large class of 
natural products mainly produced by microorganisms. 
The CDPS family includes at least eight identified 
members found in various bacterial phyla. All members 
are similar in size (239-289 residues). They display a 
moderate sequence similarity, but conserved residues are 
found in 13 positions. 

CDPSs use previously activated amino acids in the form 
of aa-tRNAs and catalyse the formation of the peptide 
bonds of the DKP scaffold (1) (Figure 1). Only two 
other enzymes families also use loaded tRNA as substrates 
to form peptide bonds (4-7): Fem aminoacyl transferases 
form cross-bridges in peptidoglycan biosynthesis (8), and 
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Figure 1. The reaction catalysed by CDPSs. CDPSs use activated amino acids, in the form of aa-tRNAs, as substrates to catalyse the formation of 
cyclodipeptides, also known as DKPs. 



aa-tRNA protein transferases modify the N-terminal ex- 
tremity of proteins for subsequent degradation (9). Fern 
aa-transferases and aa-tRNA protein transferases thus 
catalyse similar chemical reactions, in which the amino 
acid loaded on an aa-tRNA donor is transferred to the 
N-terminal amino group of a peptide or a protein 
acceptor. They both also belong to the same 
structural GCN5-related iV-acetyltransferase (GNAT) 
protein superfamily (9-11). CDPSs differ from Fern 
aa-transferases and aa-tRNA protein transferases in that 
they use two aa-tRNAs to form two peptide bonds, 
whereas transferases use only one aa-tRNA to form only 
one peptide bond. CDPSs and transferases have also dif- 
ferent structural organizations. Indeed, the crystal struc- 
ture of a CDPS, namely Rv2275 of Mycobacterium 
tuberculosis, was recently determined (12), revealing that 
Rv2275 is a homodimer, each monomer being structurally 
related to class-Ic aminoacyl-tRNA synthetases (aaRSs) 
(13-16). Rv2275 uses two tyrosyl-tRNA Tyr (Tyr- 
tRNA Tyr ) as substrates to produce cyclo(L-Tyr-L-Tyr) 
(cYY) (1). 

Here, we solved at 1.9 A resolution the crystal struc- 
ture of Streptomvces noursei AlbC, a CDPS that 
uses mainly phenylalanyl-tRNA phe (Phe-tRNA phe ) and 
leucyl-tRNA Leu (Leu-tRNA Leu ) as substrates to synthe- 
size cyclo(L-Phe-L-Leu) (cFL). The AlbC structure 
consists of a monomer. It contains a Rossmann-fold 
domain and shares similarity with the catalytic domain 
of class-Ic aaRSs, as previously reported for Rv2275 
(12). The reaction catalysed by Rv2275 was suggested 
to proceed via a ping-pong kinetic mechanism with a 
covalent intermediate in which L-Tyr is transferred 
from Tyr-tRNA Tyr to an active serine, but such a 
transfer was not conclusively demonstrated (12). Here, 
we showed that the serine residue is covalently bound 
to the aminoacyl transferred during the catalytic cycle, 
demonstrating that CDPSs use a mechanism involving a 
covalent aminoacyl-enzyme intermediate. Moreover, we 
also showed that the interaction between AlbC and its 
aa-tRNA substrates is driven by the binding of the sub- 
strate aminoacyl moiety in a deep pocket — similar to the 
tyrosine-binding pocket of TyrRSs — and the interaction 
of the tRNA moiety with at least one patch of basic 
residues. This provides clues about the molecular bases 



of the interactions between CDPSs and their aa-tRNAs 
substrates. 



MATERIALS AND METHODS 

Mutagenesis and purification of AlbC and variants 

The expression plasmid encoding AlbC was constructed 
from the vector pQE60 (Qiagen). The resulting plasmid 
encodes the full-length AlbC (residues 1-239) with the 
C-terminal addition of two residues (RS) followed by a 
His 6 tag (1). The desired variants of AlbC were created 
via PCR mutagenesis according to the QuikChange 
method (Stratagene). Thirty-two variants were con- 
structed (28 variants with a single mutation, and four 
variants with a double mutation). For 12 variants, the 
mutations are localized in the putative catalytic pocket 
(L33Y/L185D, G35A, S37A, S37C, Y178A, Y178F, 
E182A, E182Q, E182D, L200N, Y202A, Y202F), and 
for the remaining variants, the mutations are outside 
this pocket (H31A, W54A, F59A, G79A, R87A, K90A, 
R91A, K94A, R97A, R98A, R99A, R98A/R99A, R101A/ 
R102A, S120A, Y128A, C149A, P184A, R214A/R215A, 
R220A, R231A). The C-terminal His 6 -tagged proteins 
were produced and purified as described previously 
(1,17). Protein production was followed by SDS-PAGE 
analyses or western blotting according to standard proto- 
cols, and His-tagged proteins were visualized using 
alkaline phosphatase-conjugated anti-His antibody and 
BCIP/NBT solution (Sigma). 

Crystallization and structure determination 

Small crystals of AlbC (80 x 10 x 10 um 3 ) were grown by 
vapour diffusion in sitting drops at 17°C by mixing 2 ul of 
protein sample at 4mg/ml with 2ul reservoir solution. 
First crystalline precipitates were obtained by a reverse 
screening approach (18). By screening different pHs, 
crystals were obtained with reservoir containing 1.7 M 
Na/K phosphate at pH 6.0. A screening of different addi- 
tives showed that the presence of reducing agents 
improves crystals size and the addition of a large excess 
of dithiothreitol (DTT) (100 mM) gave the best results. 
These crystals were cryocooled with different cryoprotect- 
ants, but they did not diffract better than 3.5 A. Larger 
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crystals (300 x 40 x 40 um ) were obtained by a batch 
method under silicon oil and microseeding from solutions 
of crushed AlbC crystals (19). The native crystals were 
cryocooled with 35% glycerol and diffracted up to 1.9 A 
on Proximal (Soleil, France) and ID29 (ESRF, France) 
beamlines. The crystal structure was determined with 
selenomethionine-substituted AlbC by MAD (multi- 
wavelength anomalous dispersion) using phase informa- 
tion derived from anomalous scattering data collected at 
the Se K-edge. The crystals belong to the space group 14 
with a single copy of the protein in the asymmetric unit. 
The positions of four selenium atoms were determined by 
SHELXD (20). Experimental phases were calculated in 
SHARP (21), and the tracing of an initial model in 
COOT that was subsequently completed by automatic 
building in ArpwARP (22) using the AlbC native data set. 

The final model was refined in REFMAC5 (23) to R/ 
Rfree values of 0.186/0.236 at 1.9 A resolution with one 
TLS parameter for the protein chain. In the final model, 
residues 1-27, 217, the eight C-terminal residues 
[RS(His) 6 ] introduced during cloning, and the side chain 
of residue R214 were not visible in the electron density 
map, they were not included in the crystallographic 
model. The MolProbity score (24) for the refined model 
is 1.91, in the 75th percentile of structures refined at com- 
parable resolution. Almost all residues (97.6%) were in 
favoured regions of the Ramachandran plot, and 
100.0% were in allowed regions. 

In the final electron density, remaining peaks that could 
not be attributed to water were modelled with three mol- 
ecules of phosphate and a molecule of cyclo-dithiothreitol 
(cDTT), which were present in the crystallization solution 
as a precipitant agent and an additive, respectively (see 
above, cDTT comes from the oxidized fraction of the 
DTT used; Supplementary Figure SI). This observation 
suggests that the large quantity of DTT used during 
crystal optimisation served both to maintain AlbC under 
its reduced state, and to feed crystals with cDTT present in 
equilibrium with reduced DTT. Figures were prepared 
with PyMOL (PyMOL Molecular Graphics System, 
Version 1.2r3pre, Schrodinger, LLC). The programme 
Superpose (25) was used for structural alignment and 
Areaimol (26) for accessible surface area measurements. 
The coordinates and structure factors of S. noursei AlbC 
have been deposited in the Protein Data Bank (PDB) 
(accession code 30QV). 

Cyclodipeptide-synthesizing activity assays 

Assays with supernatants of cultures of Escherichia coli 
cells expressing the genetic constructs encoding AlbC 
wild-type or an AlbC variant were analysed by LC-MS/ 
MS as described previously (1). The in vivo activities are 
reported as percentage of the wild-type activity. Error bars 
represent the standard deviation for at least two independ- 
ent experiments. 

Detection and identification of the covalent 
aminoacyl-enzyme intermediate 

To detect the covalent aminoacyl-enzyme intermediate, 
purified AlbC or relevant variants were incubated with 



[ 3 H]Phe-tRNA phe , the proteins were separated by SDS- 
PAGE before being transferred on a polyvinylidene 
fluoride (PVDF) membrane, which was analysed with a 
Beta-Imager™ 2000 from Biospace (Paris, France). 
Protein concentrations were quantified by UV spectros- 
copy. The tritiated substrate was obtained as follows: 
a 100 liI reaction mixture containing 3.2 uM E. coli 
tRNA Phe (Sigma), 0.32 mM Phe, 0.9 uM [ 3 H]Phe 
(Phenylalanine, L [2,3,4,5,6- 3 H], Perkin-Elmer, 
HOCi/mmol), 2mM ATP, 1 mM DTT, 20 mM MgCl 2 , 
50 mM KC1, luM E. coli PheRS (PheRS £l ) in 50 mM 
Hepes KOH, pH 7.0 was incubated at 30°C for 30min. 
Then, AlbC or AlbC variant was added at a final concen- 
tration of 1 uM. After 20 s incubation, the reaction was 
quenched by the addition of 5 ul Laemmli loading buffer 
[final concentration, 0.1% (w/v) bromophenol blue, 
2% (w/v) SDS, 10% (v/v) glycerol, 50 mM Tris-HCl, 
pH 6.8 and lOOmM DTT] followed by heating the 
samples 2min at 50°C. These samples were immediately 
resolved by SDS-PAGE electrophoresis followed by 
blotting. Blotting of proteins onto a PVDF membrane 
was achieved using a semi-dry transfer blot apparatus 
(Bio-Rad). After transfer (400 mA, 30min), the 
membrane was dried before radioactivity analysis with 
the Beta-Imager™ 2000. This apparatus allows an 
absolute counting of the tritium R particles, with a detec- 
tion threshold of 0.007 cpm/mm for tritium. 

To identify the residue acylated and the nature of the 
acylation, the aminoacyl-enzyme intermediate was 
produced by a direct assay using purified Phe-tRNA phe . 
The aminoacylation reaction was performed as described 
above. Charged Phe-tRNA phe was recovered by phenol/ 
chloroform extraction and ethanol precipitation [the 
loading of tRNA phe with Phe was estimated to be 70%, 
as previously described (1)]. Two micrograms of 
Phe-tRNA phe were mixed with 3 u.g of AlbC (or variant) 
in 20 ul of the reaction buffer containing 50 mM Hepes- 
KOH pH 7.0 and 50 mM KC1. After 20 s of incubation, a 
5 ul-aliquot was removed and submitted to trypsin diges- 
tion: the withdrawn aliquots were diluted up to 10 ul in 
50 mM NH 4 Ac0 2 , pH 8.0 and 1 ul of trypsin (400ng/ul 
porcine trypsin, Promega) was added. Digestion was per- 
formed at 50°C up to lOmin. Sample digests were then 
acidified by 1.25% trifluoroacetic acid (TFA) in water, 
desalted and concentrated by C18 ziptip and spotted 
with 4-HCCA matrix solution (lOmg/ml in 49.8/49.8/ 
0.4: CH 3 CN/H 2 0/TFA: v/v/v) on MALDI plate. MS 
and MS/MS spectra were registered using a 4800 
MALDI-TOF/TOF mass spectrometer (Applied 
Biosy stems, Foster City, USA) in a positive reflectron 
mode. Each MS spectrum was the result of 1000 laser 
shots and calibration was applied by using trypsin autoly- 
sis fragments as internal standards. MS/MS analyses were 
performed under post-source decay (PSD) mode. Each 
MS/MS spectrum was the result of 10000 shots. The 
sequence of the trypsic fragments was identified by using 
Data Explorer® processing software (Version 4.9, Applied 
Biosystems) to assign the potential chemical modifications 
of the serine or cysteine residues. 
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RESULTS 

Crystal structure of S. noursei AlbC 

We produced AlbC in E. coli, purified it to homogeneity 
(1) and crystallized the purified AlbC. We solved the 
crystal structure of AlbC by the multi-wavelength anom- 
alous dispersion (MAD) method using selenomethionine- 
labelled protein. The native structure was refined to a final 
R-factor/R-free of 18.6/23.6% at a resolution of 1.9 A 
(Table 1). The crystal structure of AlbC is a monomer 
that presents a compact oc/p structure (Figure 2). It is 
composed of a central P-sheet with five parallel p 
strands (p3-p7) surrounded by ten a helices (a2-a9) 
(Figure 2A). The AlbC structure contains a Rossmann- 
fold domain (27) formed by strands P3-P5, helix oc2 and 
the C-terminal part of helix a4. This domain is followed 
by a helical region composed of three main helices (oc5-a7) 
that pack against the Rossmann fold. The structure ends 
with two p strands (P6 and P7) that extend the Rossmann 
P-sheet and two small connecting helices (oc8 and a9). 
AlbC residues 1-27 were not visible in the electron 
density map. Crystal packing analysis showed the 
presence of large cavities near the residue 28 that are com- 
patible with the presence of a flexible N-terminal extension 
(Supplementary Figure S2). Some regions of AlbC were 
visible in the electron density map, but they present higher 
B-factors than the rest of the protein (Supplementary 
Figure S3). These regions contain the residues 38-46 in 
loop p3-a2, the residues 105-111 in loop a4-p5, the 



residues 215-220 in loop a8-p7, and the C-terminal 
residues 236-239. 

A striking feature of o the structure of AlbC is the 
presence of a deep (14 A long) and narrow (diameter 
ranging from 4 to 7 A) surface-accessible pocket, which 
is delineated by helices oc6, a7, a8 and P-strands P3, P4 
and p6. The buried part of the pocket in AlbC is bordered 
mostly by side chains of nine hydrophobic residues: L33, 
V65, LI 19, L185, F186 and L200 at its deepest point and 
V67, Ml 52 and 1208 on the edges (Figure 2B). Four add- 
itional residues contribute to define the buried part of the 
pocket, G35, El 82 and the main chains of Y202 and 
H203. Near the residue G35, the pocket presents a small 
bulge bordered by the hydrophobic residues 134, F43, 
L51, C201 and V230. Residues P207 and VI 56, the carb- 
oxylate moiety of El 82 and the side chains of four polar 
amino acids — S37, N40, Y178, T206 — define the entrance 
of the pocket (Figure 2B). Additional electron density was 
attributed to a molecule of phosphate and to a molecule of 
cDTT, which are located at the entrance and at the 
bottom, respectively, of the pocket (Figure 2B; 
'Materials and Methods' section). The phosphate 
molecule, probably present as a H2PO4 ion at the pH 
used for crystallization (pH = 6), is within hydrogen 
bond distances of residues Y178 and El 82, and to two 
water molecules. The cDTT is surrounded by hydrophobic 
residues in the most buried part of the pocket and its 
oxygens are within hydrogen bond distances of phosphate 
and water molecules. 



Table 1. Data collection, phasing and refinement statistics 





Native 




SeMet Crystal 1 




SeMet Crystal 2 


Data collection 
Space group 
Cell dimensions 
a, b, c (A) 

a, P, y (°) 


14 

97.2, 97.2, 45.5 
90.0, 90.0, 90.0 




14 

97.9, 97.9, 45.5 
90.0, 90.0, 90.0 




14 

98.0, 98.0, 45.5 
90.0, 90.0, 90.0 






Peak 


Inflection 


Remote 


Peak 


Wavelength 
Resolution (A) 

^sym Or ^ m erge 

Ijal 

Completeness (%) 
Redundancy 


0.9793 

48.6-1.90 (2.00-1.90) 
8.70 (61.5) 
13.0 (3.10) 
99.7 (100.0) 
7.4 (6.9) 


0.9792 

69.0-2.70 (2.70-2.85) 
8.10 (28.6) 
13.8 (4.40) 
99.3 (98.9) 
4.0 (3.8) 


0.9794 

69.0-3.00 (3.16-3.00) 
9.80 (36.6) 
11.1 (3.50) 
99.1 (99.8) 
3.7 (3.9) 


0.9757 

69.0-3.50 (3.69-3.50) 
9.80 (24.2) 

11.4 (5.50) 

98.5 (99.7) 
3.4 (3.6) 


0.9792 

69.0-3.00 (3.16-3.00) 
9.10 (40.6) 
12.9 (3.80) 
99.9 (100.0) 
5.6 (5.7) 



Refinement 



Resolution (A) 


1.90 


No. reflections 


15963 


-^work/ ^I'ree 


18.6/23.6 


No. atoms 




Protein 


1621 


Ligand/ion 


23 


Water 


107 


B-factors (A 2 ) 




Protein 


29.3 


Ligand/ion 


55.8 


Water 


36.2 


Rmsd 




Bond lengths (A) 


0.006 


Bond angles (°) 


0.793 



Values in parentheses are for highest resolution shell. 



Nucleic Acids Research, 2011, Vol. 39, No. 10 4479 




B 





Figure 2. Crystal structure of S. noursei AlbC and comparison with that of M. tuberculosis Rv2275. (A) Overall structure of AlbC (region 28-236) in 
cartoon mode, with a helices, P-strands, and loops coloured in green, yellow and grey, respectively. AlbC secondary-structure elements are numbered 
according to Rv2275 numbering (12). The Rossmann-fold domain is constituted by helix oc2, the C-terminal part of helix 4 and the P-sheet < (33— P5) 
that is extended by two additional strands P6 and P7. A helical region (in light green), composed of helices oe5, a6 and txl, packs against the 
Rossmann fold. (B) Structure of the AlbC surface-accessible pocket. Residues that delineate the AlbC pocket are shown in ball and stick represen- 
tation. The most buried part of the pocket is mainly constituted of hydrophobic residues (coloured in green). The entrance of the pocket is mainly 
delineated by polar residues (coloured in orange). The boundaries of the pocket, calculated using the program CAVER (58), are defined by a 
semi-transparent grey surface. The molecules of phosphate and cDTT modelled in the electron density are shown in orange and dark green, 
respectively. (C) Superimposition of the structures of AlbC and Rv2275 in cartoon representation. AlbC and Rv2275 are coloured in green and 
magenta, respectively. Both proteins superpose well. Most significant structural differences concern N- and C-terminus regions, and also relative 
positions of helices a5, a6 and a8. (D) Superimposition of the surface-accessible pockets of AlbC and Rv2275, with an orientation similar to that 
shown in Figure 2C. Residues represented in ball and stick are essential for CDPS activity. AlbC and Rv2275 residues are indicated by A and R 
letters, respectively. The superimposition highlights the structural deviations between loops p3-ot2 and (36— a2, and residues Y202 A and Y253 R located 
at the entrance of the active sites. 



The three-dimensional (3D) superimposition of AlbC 
and Rv2275 (PDB id: ljlu) [root mean square deviation 
(rmsd) value of 2.27 A over 192 Ca] revealed that the two 
enzymes superimpose well despite sequence divergence 
(AlbC shares 25.9 % identity and 40.6 % similarity with 
Rv2275; Figure 2C). Almost all secondary-structure 



elements superimpose well, in particular the Rossmann- 
fold domain and the two additional strands P6 and P7. 
However, there are some major differences between the 
two enzymes. First, the region 50-77 of Rv2275 is 
structured (strands pi, (32 and helix al) while the corres- 
ponding region of AlbC (1-27) is not observed in the 
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electron density maps. At opposite, two regions of AlbC, 
the loop a.6-al and the C-terminal helix a9 are structured 
in AlbC, and the corresponding regions are not observed 
for Rv2275. Second, the axes of helices oc5, a6 and a8 are 
tilted by some degrees, so that residues located at these 
helices extremities superimpose poorly. Finally, important 
deviations are observed between loops, in particular loops 
P3-a2, o4-P5, (36-a8 and a8-p7. 

Both enzymes possess a similar surface-accessible 
pocket that contains six residues conserved in the CDPS 
family. Four of these residues, G35 (AlbC numbering) 
belonging to the strand P3, and Y178, E182 and 
PI 84 belonging to the helix a7, are remarkably well 
superimposed. The two other conserved residues S37 
and Y202, respectively, belong to loops |33-a2 and P6- 
oc8 that have different conformations as mentioned 
above. In particular, the side chain of Y202 adopts a con- 
formation directed towards solvent in AlbC whereas it is 
directed towards the active site entrance and residue S37 in 
Rv2275 (Figure 2D). 

Structural similarity of AlbC with the catalytic domain 
of the class-I aminoacyl-tRNA synthetases 

We used the Dali server (28) to compare the structure 
of AlbC to protein structures in the PDB. Due to the 
Rossmann-fold domain, AlbC was found to share struc- 
tural similarity with more than 300 proteins with a high 
Z-score (Z>4) (Supplementary Table SI). Most of these 
proteins are class-I aaRSs, enzymes that catalyse the acti- 
vation of amino acids and their transfer to cognate tRNAs 
to form aa-tRNAs. The 120 solutions with the highest 
Z-scores, from 10.7 to 7.3, were all obtained with the 
closely related class-Ic TyrRSs and TrpRSs, as previously 
reported for Rv2275 (12). The greatest similarity is with 
the archaeal Archaeglobus fulgidus (PDB id: 2cyb) and 
Methanococcus jannaschii TyrRSs (TyrRS^) (PDB id: 
lzh6, ljlu), and with the eukaryotic Entamoeba histolytica 
TrpRS (TrpRSjsa) (PDB id: 3hzr) (Supplementary Table 
SI). The 3D superimposition of AlbC and TyrRS^ (rmsd 
value of 3.5 A over 160 Co) (Figure 3) or TrpRS £ /, (rmsd 
value of 3.0 A over 163 Ca) (Supplementary Figure S4) 
revealed that AlbC superimposes well with the 
N-terminal catalytic domains of the two aaRSs, despite 
low-sequence similarity (AlbC shares 15.5 and 15.8% 
similarity with the catalytic domains of TyrRS M y and 
TrpRS^/,, respectively). Note that these enzymes possess 
C-terminal tRNA-binding domains whereas AlbC does 
not. The catalytic domains of each of the two aaRSs are 
composed of a Rossmann-fold domain and a connective- 
polypeptide 1 (CP1) domain that are both found in AlbC. 
The overall position of the two domains is conserved 
between AlbC and the two aaRSs and is mediated by sub- 
stantial conservation of residues at the interface between 
the two domains. Most secondary-structure elements are 
well conserved, and the central P-sheets are well 
superimposed, but the helices that pack against these 
sheets present slight differences in length and positioning 
(Figure 3; Supplementary Figure S4). Moreover, the 
pocket in AlbC and the aminoacyl binding pockets in 
the two aaRSs are positioned similarly relative to the 



Catalytic 




Figure 3. Superimposition of the structures of AlbC and TyrRS M; in 
complex with its L-tyrosine substrate (PDB id: ljlu). The two enzymes 
are shown in cartoon mode, and L-tyrosine is in ball and stick coloured 
in orange. The Rossmann-fold and the CP1 domains of TyrRSs,- are 
coloured in dark and light blue, respectively, and the corresponding 
domains of AlbC are coloured in dark and light green, respectively. 
The Rossmann-fold and CP1 domains of the two proteins have similar 
rmsd values (3.25 A over 96 Cot and 3.26 A over 53 Ca, respectively). 
TyrRSM/ possesses two additional regions not present in AlbC: a 
C-terminal domain involved in tRNA-binding and anticodon recogni- 
tion coloured in grey, and a short N-terminal region coloured in light 
blue. 



P-sheet forming part of the Rossmann-fold domain 
(29-33). 

However, there are some major differences between 
AlbC and the two aaRSs. The hydrophobic regions of 
the CP1 domains of TyrRS or TrpRS involved in the 
homodimerization of these enzymes (34,35) are not 
found in AlbC. In addition, when we mimicked this asso- 
ciation for two AlbC monomers by superimposing them 
on the crystal structure of TyrRSM,-, we did not find any 
complementary dimer interface, and we observed major 
steric hindrances between helices oc5, a6 and a7 of each 
AlbC monomer (Supplementary Figure S5A). However, 
the crystal structure of Rv2275 was obtained as a 
homodimer. But, the Rv2275 residues that make up the 
interface belong to other secondary-structural elements 
than those involved in the homodimerization of the two 
aaRSs (12). Noted that these residues are not conserved in 
CDPSs (Figure 4 and Supplementary Figure S5B). All 
these observations are consistent with AlbC being found 
as a monomer both in solution (1) and in crystal forms. 
Another major difference between CDPSs and class-I 
aaRSs is that CDPSs do not have the two consensus 
motifs that are conserved in class-I aaRSs and are 
involved in ATP binding (36-38) (Figure 4). 

The pocket of AlbC, a binding site for the aminoacyl 
moiety of the substrate 

Using 3D superimposition, we compared the pocket in 
AlbC to the conserved amino acid-binding pockets in 
TyrRS M/ and TrpRS i/7 (29-33). For example, the AlbC 
pocket was compared to that of the TyrRS M/ complexed 
with its tyrosine substrate (Figure 5), which forms a 
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Figure 4. Structure-based sequence alignment of six CDPSs and three TyrRSs from the three primary kingdoms. The secondary-structural elements 
of AlbC and the TyrRS M/ are indicated above and below the alignment, respectively. The Rossmann-fold and the CP1 domains of AlbC are shown 
in dark and light green, respectively. The N-terminal region, the Rossmann-fold domain, and the CP1 domain of the TyrRS^, are shown in yellow, 
dark and light blue, respectively. The core regions of AlbC and the TyrRS^, displaying greatest structural similarity (rmsd value of 2.36 A over 116 
Cos) are shown with a gray background. The residues conserved among CDPSs, and those conserved among TyrRSs are indicated by a red 
background. Black stars above the AlbC sequence indicate the residues that were modified by site-directed mutagenesis in this study. The basic 
residues of CDPSs, which define a patch of positively charged residues belonging to the helix a4, are indicated by an orange background. The 
residues that are disordered in the crystal structure of AlbC and Rv2275 are shown in grey. The residues of TyrRSs previously shown to be involved 
in tyrosine recognition are indicated in dark blue; the position of the two clusters involved in acceptor stem recognition (41) are indicated by pink 
lines below the TyrRSs sequences, and the residues involved in this recognition are indicated by a pink background. Black lines below the TyrRSs 
sequences indicate the class I signature motifs, HIGH and KMSKS, which are involved in ATP binding. Rv2275 residues that are involved in the 
interface of the dimer are shown with a green background. Protein sequence alignments were done according to structural alignments for AlbC and 
TyrRSs and were corrected manually for CDPS based on previously described alignments (1). Two CDPSs are not mentioned, namely YvmC from 
Bacillus thuringiensis and YvmC from B. licheniformis, because they are very close to YvmC from B. subtilis. Abbreviations: Bsub, B. subtilis; BacSt, 
Geobacillus stearothermophilus; MetJa, M. jannaschii. 



hydrogen bond network with the conserved residues Y32, 
Y151, Q155, D158 and Q173, and interacts with the 
conserved residue G34 (29-31) (PDB id: ljlu) (Figure 
5A). The two pockets are positioned similarly relative to 
the Rossmann-fold. Each pocket contains some residues 
that are highly conserved in the corresponding family and 



that are remarkably well superimposed: the AlbC residues 
G35, Y178 and E182 superimpose with the TyrRS^- 
residues G34, Y151 and Q155, the rmsd values on all 
atoms being 1.30, 1.29 and 1.95 A, respectively (Figures 
4 and 5B). These structural similarities suggest that the 
pocket of AlbC could accommodate the aminoacyl 
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Figure 5. Comparison of the AlbC and TyrRS M; pockets. (A) The tyrosine-binding pocket of TyrRS^, in complex with its L-tyrosine substrate. 
TyrRS residues and L-tyrosine are represented in ball and stick, and coloured in blue and orange, respectively, with oxygen and nitrogen atoms 
coloured in red and dark blue, respectively. (B) Superimposition of the TyrRS residues (A) with the corresponding AlbC residues. AlbC residues are 
represented in ball and stick, and coloured in green, with oxygen and nitrogen atoms coloured in red and dark blue, respectively. Residues lining the 
two pockets superimpose with an rmsd value of 1 .49 A over 32 main chain atoms. (C) The AlbC pocket in the presence of the phenylalanyl moiety of 
a Phe-tRNA phe substrate. The phenylalanyl moiety position is mimicked on that of the L-tyrosine in the pocket of TyrRSj^ as shown in (A) and (B), 
except that the hydroxyl group has been removed. The conserved residue S37 and the residue L200, which have no corresponding residues in TyrRS, 
are shown in pink, with oxygen and nitrogen atoms coloured in red and dark blue, respectively. (D and E) Site-directed mutagenesis study of AlbC. 
Cyclodipeptide-synthesizing activities are shown with error bars. The corresponding western blots indicating amounts of the proteins are also shown. 
(D) cFL-synthesizing activities of the wild-type AlbC (in blue) and of each of the variants in which a residue conserved among CDPSs is substituted 
(in grey). (E) cFL- and cYL-synthesizing activities (in blue and yellow, respectively) of the wild-type AlbC and the variant L200N. The wild-type 
AlbC synthesizes 33.5 ± 2.5mgl~ 1 of cFL and 4.9 ± 0.5mgl~ of cYL (Supplementary Figure S6). 



moiety of an aa-tRNA substrate in the same way that 
TyrRSs bind their tyrosine substrates. To test this possi- 
bility, we independently substituted each residue in AlbC 
suspected to interact with the aminoacyl moiety of the 
substrate by alanine and, where relevant, other amino 
acids. We produced the resulting variants in E. coli 
and compared their in vivo synthesis of cFL, the major 
cyclodipeptide produced by AlbC, to that of the 
wild-type enzyme (1). The substitution of Y178 with 
alanine resulted in an almost inactive variant (Y178A), 
and its substitution with phenylalanine gave a poorly 
active variant (Y178F), despite being produced in 
amounts similar to that of the wild-type enzyme (Figure 
5D). The variants E182A and E182Q were inactive, and 
the variant E182D had a weak but detectable activity 
although its production was strongly affected. The 
variant G35A was poorly active despite being abundantly 



produced (Figure 5D). Thus, the hydroxyl group of Y178 
and the carboxylate group of El 82 are essential for AlbC 
activity, suggesting that these groups may mediate 
hydrogen bonds with the main chain amino group of the 
substrate, in a way similar to what is described for 
TyrRSs. The conserved residue G35 is also essential for 
the accommodation of the CDPS substrate. The conserved 
residues Y32 and D158 in TyrRS^ form hydrogen 
bonds with the hydroxyl group of the tyrosine substrate 
(Figure 5A), and superimpose with the hydrophobic 
residues L33 and L185, respectively, in AlbC (Figure 
5B). This is consistent with hydrophobic phenylalanyl 
or leucyl moieties in the AlbC substrates. To try to 
accommodate a tyrosyl-tRNA Tyr substrate in the AlbC 
pocket, we attempted to construct an L33Y/L185D 
AlbC variant, but the genetic construct was not expressed 
(Figure 5D). 
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All these data are consistent with the pocket of AlbC 
being the binding site of the aminoacyl moiety of an 
aa-tRNA, but do not prove it. We therefore replaced a 
residue in the AlbC pocket suspected to interact with the 
aminoacyl moiety of the substrate to see if this may 
modify the enzyme's specificity. The sequence alignment 
of CDPSs (Figure 4), and the specificity determined for 
each characterized CDPS (1), led us to focus on Rv2275 
from M. tuberculosis that mainly synthesizes cyclo(L-Tyr- 
L-Tyr) (cYY) (1,3). The residue L200 located near the 
bottom of the AlbC pocket (Figure 5C) corresponds to 
an asparagine in Rv2275 (Figure 4). This asparagine 
residue may form a hydrogen bond with the hydroxyl 
group of the tyrosyl moiety of a tyrosyl-tRNA Tyr sub- 
strate. We constructed the variant L200N, and tested its 
in vivo specificity: it synthesized mainly cYL instead of 
cFL and thus showed some of the specificity of Rv2275 
(Figure 5E, Supplementary Figure S6). This indicates that 
the substitution of one residue located in the AlbC pocket 
is sufficient to change the nature of one of the aminoacyl 
residue incorporated into the cyclodipeptides it produces. 
This also demonstrates that the AlbC pocket is the 
binding site of the phenylalanyl moiety of the Phe- 
tRNA he substrate. But, this result does not provide 
data on the binding of the leucyl moiety of the other 
substrate, which is required to synthesize a molecule 
of cFL. 

Formation of a covalent intermediate during the catalytic 
cycle of AlbC 

The superimposition of the pockets of AlbC and TyrRS^ 
revealed significant similarities (see above), but also 
intriguing differences. The strictly conserved residue S37 
of AlbC has no corresponding conserved residue in class 
Ic-aaRSs (Figures 4 and 5B). The position of the 
phenylalanyl moiety of a Phe-tRNA in the AlbC 
pocket, as predicted by analogy with the tyrosine substrate 
in the TyrRS^/ pocket (Figure 5C), indicates that the Oy 
atom of S3 7 would be located in the proximity of the 
carbonyl carbon of the esterified Phe-tRNA phe substrate. 
This organization together with the catalysed reaction — 
the formation of a peptide bond from aminoacyl-tRNA — 
suggests that S37 may be a catalytic residue serving as a 
nucleophile. This is consistent with the fact that substitu- 
tion of S37 with alanine resulted in an inactive variant 
(S37A), but its substitution with the nucleophilic 
cysteine (S37C) did not completely abolish the enzymatic 
activity (Figure 5D). So, S37 may attack the carbonyl 
carbon of the esterified Phe-tRNA phe substrate, leading 
to the covalent phenylalanylation of S37. The formation 
of a covalent intermediate in which L-Tyr is transferred 
from Tyr-tRNA Tyr to an active serine (S88) was also sug- 
gested for Rv2275, using radioactive labelling (12). 
Moreover, the S88 residue of Rv2275 was suggested to 
be activated by the conserved Y253 residue, which corres- 
ponds to Y202 in AlbC (12). To verify the existence of a 
covalent intermediate, we further analysed AlbC and the 
five variants S37A, S37C, Y178F, E182Q and Y202F. 
The enzymes were incubated in the presence 
of [ 3 H]Phe-tRNA phe , and they were separated by 



WT S37C E182Q S37A Y202F Y178F 



Figure 6. Covalent labelling of AlbC and variants by tritiated Phe 
transferred from [ 3 H]Phe-tRNA phe . Enzymes were incubated with 
[ 3 H]Phe-tRNA phe , as described in 'Materials and Methods' section, 
separated on SDS-PAGE, then transferred onto a PVDF membrane 
that was analysed with a radioimager. 



SDS-PAGE before being transferred on a PVDF 
membrane, which was analysed with a radioimager. As 
observed for Rv2275, wild-type AlbC retained a high 
amount of the radiolabel, whereas the variant S37A did 
not retain any radiolabel. The variant S37C presented be- 
haviour similar to that of the wild-type enzyme, confirm- 
ing the formation of a covalent intermediate and the 
probable status of S37 as the point of covalent attachment 
(Figure 6). The variants Y178F and E182Q retained sig- 
nificant amounts of the radiolabel although much less 
than the wild-type enzyme (Figure 6). This result, 
combined with the substantial and complete loss of 
enzyme activity for the variants Y178F and E182Q, re- 
spectively (Figure 5D), argues that Y178 and E182 may 
play a crucial role in the good positioning of the 
aminoacyl moiety of the substrate in the active site — 
essential for an efficient formation of the covalent inter- 
mediate — concerning the variant Y202F, it retained 
~11% of the enzymatic activity (Figure 5D), and 
at least as much radiolabel as the wild-type enzyme 
(Figure 6). As the removal of the hydroxyl group in the 
variant Y202F does not prevent the formation of the 
covalent intermediate, this demonstrated that Y202 is 
not responsible for the S37 activation in AlbC. 

In order to unambiguously identify the residue acylated 
and the nature of the acylation, we performed peptide 
mass fingerprint (PMF) analyses on trypsin-digested 
AlbC, previously incubated or not with Phe-tRNA phe . 
AlbC without substrate incubation gave two identified 
fragments containing the residue S37 (fragments [29^46] 
and [27^16]) (Supplementary Figure S7A). The corres- 
ponding m/z of these fragments (calculated m/z 1878.94 
and 2163.09, respectively) were selected and isolated as 
precursor ion for PSD MS/MS sequence characterization 
(Supplementary Figure S7B and C). The same experiment 
performed on AlbC incubated with Phe-tRNA he gave 
two additional fragments (Figure 7A). The m/z of these 
two fragments were 2026.01 (observed m/z 2026.00) and 
2310.16 (observed m/z 2310.17), and corresponded to the 
values expected for the addition of a phenylalanyl moiety 
(addition of 147.07) on fragments [29-46] and [27-46], 
respectively. These fragments, sequenced unambiguously 
by both b- and y-ions series, contained a phenylalanyl 
moiety on S37 (Figure 7B and Supplementary Figure 
S8). This demonstrated the formation of a covalent inter- 
mediate, in which L-Phe is transferred from Phe-tRNA phe 
to S37 during the catalytic cycle. Moreover, the same ex- 
periment was performed on relevant AlbC variants. No 
modified fragments were observed for the variant S37A 
(Supplementary Figure S9). For the variants E182Q and 
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Figure 7. Identification of the covalent aminoacyl-enzyme intermediate. (A) Peptide mass fingerprint (PMF) analysis of trypsin-digested AlbC 
pre-incubated with Phe-tRNA phe . Arrows indicate the formation of two additional fragments compared to PMF analysis of trypsin-digested 
AlbC not incubated with Phe-tRNA phe (Supplementary Figure S7): one at m/z 2026.00 (continuous line frame) corresponding to phenylalanylated 
fragment [29-46] at m/z 1878.94, and the other at m/z 2310.17 (dashed line frame) corresponding to phenylalanylated fragment [27^46] at m/z 
2163.08. (B) PSD-MS/MS spectrum of m/z 2026.00. Both b- and y-ions series identify the amino acids sequence of the modified AlbC fragment 
[29^46], and the residue 37 as the phenylalanylated residue. PSD-MS/MS spectrum of m/z 2310.17 is shown in Supplementary Figure S8. 



Y178F, phenylalanylated fragments could be significantly 
detected, but in much less amounts than those obtained 
with the wild-type enzyme (Supplementary Figures S10 
and Sll), in agreement with radiolabelling experiments 
(Figure 6). These fragments were formed with amounts 
insufficient for PSD-MS/MS analyses (Supplementary 
Figures S10 and Sll). For the variant Y202F, the 
phenylalanylated fragments were observed in proportions 
similar to those obtained with the wild-type enzyme, and 
were unambiguously sequenced (Supplementary Figure 
SI 2). The results confirm that Y202 is not responsible 
for S37 activation. For the variant S37C, the 
phenylalanylation of the fragments [27^19] and [29^16] 
(respectively, calculated m/z of 2040.97 and 2326.13 
instead of 1894.90 and 2179.06) was detected (Supple- 
mentary Figure S13A). MS/MS sequencing of both m/z 
demonstrated the formation of a thioester link between 
the phenylalanyl moiety and the thiol group of C37 
(Supplementary Figure S13B and C). All these data 
allow the proposal of a mechanism for the formation of 
a covalent intermediate during the catalytic cycle of the 
CDPS AlbC (see Discussion section). 



Distribution and role of the residues conserved among 
CDPSs but located outside the active site 

Seven of the 13 residues conserved among CDPSs are 
outside the active site: H31, W54, F59, G79, S120, Y128 
and CI 49 (Figure 4 and Supplementary Figure SI 4). Each 
of G79, S120 and C149 was substituted with alanine 
without any significant effect on the activity or production 
of the corresponding variants amounts (Figure 5D). 
Variants H31A, W54A and F59A expressed very substan- 
tially lower activities than the wild-type enzyme, and were 
produced in significantly (H31A and F59A) or substan- 
tially (W54A) smaller amounts (Figure 5D). The residues 
H31, W54 and F59 are located in strand P3, helix a2 and 
loop a2-p4, respectively. The residue H31 is fully buried, 
and its side chain makes two hydrogen bonds with the 
residues D63 and SI 98 that are located in neighbouring 
strands. The residues W54 and F59 are highly buried and 
their side chains interact with hydrophobic residues from 
strand (36 (Supplementary Figure S14A and B). The sub- 
stitution of each of these three residues with alanine 
should lead to a significant structural destabilization, 
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which is consistent with our results showing that the pro- 
duction of the corresponding variants is strongly affected. 
The substitution of Y 128 by alanine did not impair the 
production of the corresponding variant, but abolished 
cFL synthesis (Figure 5D). The residue Y128, which is 
located at the beginning of helix a5, is fully buried and 
its side chain makes two hydrogen bonds with Q123 from 
helix r|l and R132 from helix a5 (Supplementary Figure 
S14A and C). The substitution of Y128 by alanine should 
alter the relative positions of these helices and neighbour- 
ing interactions, resulting in a loss of AlbC activity. 

Recognition of the tRNA moiety of the aa-tRNA 
substrates by AlbC 

The electrostatic potential surface of AlbC presents a 
highly biased distribution of charged residues, which 
could interact with the tRNA substrate by forming salt 
bridges with the phosphates of the tRNA backbone or 
hydrogen bonds with the nucleotide bases. In particular, 
there is a large patch of o positively charged residues 
covering a surface of 1040 A~ (Figure 8A). This patch is 
mostly composed of residues belonging to the helix a4, 
which contains nine basic residues. These residues, R87, 
K90, R91, K94, R97, R98, R99, R101 and R102, protrude 
toward the solvent, and a similar pattern of basic residues 
is found in all CDPSs (Figures 4 and 8B). We substituted 
with alanine each of the basic residues in helix oc4. All the 
resulting variants were produced with yields similar to that 
of the wild-type enzyme but their cFL-synthesizing 
activities were diverse (Figure 8C). The variants R98A, 
R99A and R98A/R99A displayed only 19, 8 and <2% 
of the wild-type activity, respectively. Five other variants 
(K90A, K94A, R97A, R101A and R102A) had activities 
that were 53-72% of the wild-type enzyme. The activities 
of the two remaining variants, R87A and R91A, were 
similar to that of the wild type. These results showed 
that the basic residues located in the C-terminal half of 
the helix oc4, especially R98 and R99, are important for 
cFL production, probably because they interact with the 
tRNA moiety of the substrate. A second patch of basic 
residues, consisting of R214, R215 and R220 from loop 
oc8— f37 and R231 from loop (37-a9, is present on AlbC 
(Figure 8A). We substituted these residues and showed 
that the corresponding variants displayed properties 
similar to the wild-type enzyme (Figure 8C), indicating 
that this patch is not involved in tRNA binding. This is 
consistent with the fact that this patch is not conserved 
among CDPSs (Figure 4). Our results suggest that the 
interaction between AlbC and its aa-tRNA is driven by 
the binding of the substrate aminoacyl moiety in its deep 
pocket and the interaction of the tRNA moiety with at 
least one patch of basic residues on helix oc4. 

DISCUSSION 

We report the 3D structure of AlbC, a member of the 
recently identified CDPS family. AlbC presents a 
compact a/p structure with a Rossmann-fold domain, as 
it was also recently reported for the CDPS Rv2275 (12). 
We show that, although having many structural 



similarities, AlbC and Rv2275 have significant differences 
that raise intriguing questions regarding the catalytic 
mechanism used by the CDPSs to synthesize 
cyclodipep tides. 

AlbC, and Rv2275, share a high structural similarity 
with the catalytic domain of class-Ic aaRSs. aaRSs 
catalyse specific esterifications of particular amino acids 
to their cognate tRNAs in a two-step reaction. First, 
amino acids are activated with ATP to form enzyme- 
bound aminoacyl-adenylate intermediates. Second, the 
aminoacyl moieties are transferred to tRNAs to form 
aa-tRNA products. The structural similarities concern 
only the catalytic domains of class-I aaRSs, which are re- 
sponsible for the binding of amino acid and ATP sub- 
strates. The amino acid-binding pockets of class-I aaRSs 
superimpose well with the AlbC pocket and we show that 
this pocket interacts with the aminoacyl moiety of the 
aminoacyl-tRNA substrate. However, AlbC does not 
possess the HIGH motif involved in ATP binding that is 
conserved among all catalytic domains of class-I aa-RSs 
(36-38). Also, AlbC does not have the loop connecting the 
catalytic domain to the C-terminal tRNA-binding 
domain, which contains the second ATP-binding motif 
— KMSKS — in class-I aa-RSs. This is consistent with 
the fact that CDPSs do not activate amino acids using 
ATP, but use already activated amino acids in the forms 
of aa-tRNAs (1). Another significant difference is that 
AlbC does not possess the C-terminal domain of aaRSs, 
which is crucial for their interaction with tRNAs (39). 

The AlbC specificity seems to be directed mainly at the 
aminoacyl moiety, and not at the sequence of the tRNA 
moiety of the substrate. Indeed, the substitution of one 
residue of the aminoacyl-binding pocket of AlbC by the 
corresponding residue of Rv2275 gave a variant that, in 
E. coli, mainly bound tyrosyl-tRNA Tyr rather than 
phenylalanyl-tRNA phe . Comparing the acceptor stems of 
the two tRNA moieties revealed that they share the se- 
quences of the single-strand extremity (A 76 C 75 C 74 A 73 ) and 
the first base pair (C 72 -G'), suggesting that some specifi- 
city determinants could lay in these conserved sequences 
of the tRNA moiety of the substrate. We previously 
showed that AlbC is able to efficiently use various 
E. coli aa-tRNAs to synthesize various cyclodipeptides 
(1). We compared the sequences of all E. coli aa-tRNAs 
that can serve as substrates for AlbC (predominantly 
phenylalanyl- and leucyl-tRNAs, and to a lesser extent 
methionyl-, tyrosyl-, alanyl- and valyl-tRNAs). For all 
these tRNA families, we found molecules that also share 
the same sequence: A 76 C 75 C 74 A 73 C 72 -G'. This might 
suggest that sequences of the single-strand extremity and 
the first base pair, but not those of the double-stranded 
acceptor stem, are specifically recognized by AlbC. Noted 
that aa-tRNA protein transferases recognize the aa-tRNA 
in a tRNA sequence-independent manner, the double- 
stranded acceptor stem being fully dispensable (40). 

An interesting question concerns the tRNA-binding 
mode of AlbC. Could it be similar to that described for 
class-Ic aaRSs? Recognition of tRNAs by TyrRSs and 
TrpRSs mainly involves their C-terminal tRNA-binding 
domains, essential for tRNA binding (39). However, 
several residues belonging to helices a7 and a9 of their 
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Figure 8. Basic patch residues contributing to CDPS activity. (A) Electrostatic surface potential of AlbC, mapped on its solvent-accessible surface at 
contouring levels of ±5 kTe-1. Positive charge is in blue, negative charge in red. The potential was calculated using APBS in PyMol (59) (http:// 
www.pymol.org/) from a derived model of AlbC where the missing side chain of residue R215 and the missing residue T217 were added. AlbC 
structure is shown in grey and the side chains of all mutated basic residues are identified as sticks. (B) Electrostatic surface potential of Rv2275 
generated like that of AlbC. (C) Site-directed mutagenesis study of AlbC. cFL-synthesizing activity of the wild-type AlbC (in blue) and of each of the 
variants in which a basic residue is substituted (in grey) are shown with error bars. The corresponding western blots indicating amounts of the 
proteins are also shown. Left, the residues belonging to the helix os4; right, other basic residues. 



N-terminal catalytic domains interact with the acceptor 
stems of tRNAs (Figure 4) (29,32,41-46). AlbC does not 
possess a C-terminal tRNA-binding domain like those in 
aaRSs nor any residue identical to those of the N-terminal 
catalytic domain involved in the tRNA interaction (Figure 
4). Otherwise, we showed that a patch of basic residues, 
which is conserved among CDPSs but not present in 
class-I aa-RSs, is important for AlbC activity. This 
patch may interact with the phosphate backbone of the 
tRNA body, as proposed for aa-tRNA-protein transfer- 
ases and Fem aa-transferases (47). Comparing the 
position of the basic patch in AlbC with that of the 
acceptor stem-binding region in TyrRS^/ revealed that 
they are different (Supplementary Figure SI 5). All these 
data might suggest that the tRNA-binding mode of AlbC 
is different from that of class-I aa-RSs, but this issue 
remains to be addressed. 



We wondered how AlbC uses two tRNA substrates to 
form cyclodipeptides. One possibility is that AlbC acts as 
a homodimer, each monomer interacting with one 
aa-tRNA. Although AlbC was obtained as a monomer 
both in solution (1) and in crystal forms, we cannot 
exclude that aa-tRNA binding induces dimerization. 
Noted that class-Ic aaRSs act as functional dimers, but 
they are also obtained as dimers in crystals (44). Class-Ic 
aaRSs dimers are mainly formed by the association of a 
hydrophobic surface involving residues of the CP1 domain 
of each catalytic domain (34,35). The positions of these 
residues are not conserved in AlbC. In addition, when 
we mimicked this association for two AlbC monomers 
by superimposing them on different crystal structures of 
TyrRSs or TrpRSs, we observed major steric hindrances 
between helices a5, a6 and a7 of each AlbC monomer. 
Therefore, it is unlikely that AlbC dimerizes in a manner 
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Figure 9. Proposed mechanism of covalent phenylalanyl-enzyme formation for AlbC. (A) Proposed anchoring of the Phe-tRNA e substrate at the 
AlbC active site. The residues S37, Y178 and E182, which are essential for the covalent intermediate formation (Figures 6 and 7), are represented in 
green. The residue L200, shown to interact with the phenylalanyl moiety of the substrate (Figure 5E), is also represented in green. As no residue in 
the AlbC active site is likely to deprotonate the hydroxyl group of S37, the S37 activation is proposed to be achieved by a concerted proton shuttling 
mechanism involving the two adjacent vicinal hydroxyls of the nucleotide A 76 of the tRNA moiety. (B) The resulting covalent phenylalanyl-enzyme. 



similar to class-Ic aaRSs, and furthermore, any such di- 
merization would result in a physical separation of the two 
AlbC catalytic pockets, inconsistent with the formation of 
two peptide bonds. This is indeed what was observed for 
Rv2275, which is found as a homodimer: the residues that 
make up the interface belong to other secondary structural 
elements than those involved in the dimerization of 
class-Ic aaRSs, and the two Rv2275 catalytic pockets are 
not at the dimer interface, but are physically separated 
(12). Moreover, it is unlikely that AlbC dimerizes in a 
manner similar to Rv2275 since the residues that are 
involved in the Rv2275 interface are not conserved in 
AlbC and other CDPSs. As it seems more probable that 
AlbC is active as a monomer, we considered whether AlbC 
catalyses its two-substrate reaction using a ternary 
complex or a ping-pong mechanism. The size of the 
AlbC pocket does not appear appropriate to accommo- 
date the aminoacyl moieties of two aa-tRNA substrates 
concomitantly. Thus, one aa-tRNA may bind to AlbC 
and its aminoacyl moiety would be stored as a covalent 
intermediate, i.e. as an aminoacyl-enzyme intermediate. 
The formation of a covalent intermediate resulting from 
the transfer of L-Tyr from Tyr-tRNA Tyr to an active serine 
was suggested for Rv2275, but such a transfer was not 
conclusively demonstrated (12). We demonstrated that 
this transfer occurs in AlbC because we showed that 
the active serine residue is covalently bound to the 
phenylalanyl transferred from Phe-tRNA phe during the 
catalytic cycle. An intriguing point that remains to be 
determined concerns the activation of the catalytic 
serine, i.e. the deprotonation of its hydroxyl group 
serving as the nucleophile. For Rv2275, it was suggested 
that the catalytic serine (S88) residue is activated by a 
conserved tyrosine residue (Y253). This is consistent 
with the hydrogen bonding of Y253 to the side chain 



hydroxyl group of S88, and the much lower (by a factor 
of 200) catalytic activity of the Y253F variant (12). For 
AlbC, the corresponding tyrosine residue (Y202) adopts a 
different conformation in the AlbC structure, and the 
Y202F variant displays only a 10-fold decrease in catalytic 
activity. We also unambiguously showed that the forma- 
tion of the covalent aminoacyl intermediate is not affected 
by the substitution of Y202 with phenylalanine, 
demonstrating that Y202 is not responsible for the activa- 
tion of S37 in AlbC. As no other residue in the vicinity of 
S37 can account for its activation, an attractive hypothesis 
is that the serine activation results from a concerted 
proton shuttling mechanism involving the two adjacent 
vicinal hydroxyls of the nucleotide A 76 of the tRNA 
moiety (Figure 9), as it has been proposed for the riboso- 
mal peptididyl transferase (48) and the FemX aa- 
transferase of Weissella viridescens (49). At this stage, we 
cannot exclude some rearrangements in AlbC active site 
during the different steps of the reaction. The analysis of 
B-factors around AlbC active site suggests that the loop 
(33-oe2, which precedes the position S37, may present some 
flexibility. The crystal structure of AlbC complexed 
with different intermediates of the reaction should 
clarify this point. But, whatever how the serine activation 
is performed, we demonstrated that CDPSs use a ping- 
pong mechanism involving a covalent aminoacyl-enzyme 
intermediate. The aminoacyl-enzyme may then react 
with the aminoacyl moiety of the second aa-tRNA to 
form a dipeptidyl-enzyme or dipeptidyl-tRNA intermedi- 
ate, which may then undergo intramolecular cyclization 
leading to the formation of the cyclodipeptide product. 

In conclusion, we report the crystal structure of AlbC, a 
CDPS from S. noursei. It confirms that CDPSs are struc- 
turally similar to the catalytic domain of class-I aaRSs. 
This structural similarity has provided clues about the 
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molecular bases of the interactions between CDPSs and 
their aa-tRNAs substrates. Moreover, we demonstrate 
that CDPSs catalyse their two-substrate reactions via a 
ping-pong mechanism involving a covalent aminoacyl- 
enzyme intermediate. CDPSs provide a novel example 
of the exaptation of aaRSs in processes not related to 
ribosomal protein synthesis. Some atypical aaRSs and 
aaRS-like proteins have already been shown to be 
involved in amino acid biosynthesis, RNA modification 
(50,51), and in non-ribosomal peptide synthesis (52,53). 
These proteins include enzymes having the catalytic 
domain of an aaRS and lacking its tRNA-binding 
domain. Thus, the archaeal Asn synthetase, a paralog of 
AsnRS, does not form Asn-tRNA , but synthesizes Asn 
by amidation of the P-carboxyl group of aspartate with 
ammonia (54). YadB (renamed glutamyl-queuosine 
tRNA Asp synthetase), a paralog of GluRS, catalyses 
glutamylation of queuosine in the wobble position of 
tRNA Asp (55,56). Amino acid:[carrier protein] ligases, 
which are SerRS homologues, catalyse the ATP- 
dependent activation of particular amino acids and their 
transfer to the phosphopantetheine prosthetic group of 
carrier proteins (57). Finally, CDPSs differ from these 
other aaRS-like proteins in that they have not only 
diverged from conventional aaRSs, but they have also 
acquired new active site residues, converting them into 
self-contained cyclodipeptide-forming enzymes, which 
can use aa-tRNAs as substrates. 
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